Classical capacity of the lossy bosonic channel: the exact solution 
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The classical capacity of the lossy bosonic channel is calculated exactly. It is shown that its 
Holevo information is not superadditive, and that a coherent-state encoding achieves capacity. The 
capacity of far-field, free-space optical communications is given as an example. 
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A principal goal of quantum information theory is eval- 
uating the information capacities of important commu- 
nication channels. At present — despite the many efforts 
that have been devoted to this endeavor and the theo- 
retical advances they have produced Q — exact capacity 
results are known for only a handful of channels. In this 
paper we consider the lossy bosonic channel, and we de- 
velop an exact result for its classical capacity C, i.e., 
the number of bits that it can communicate reliably per 
channel use. The lossy bosonic channel consists of a col- 
lection of bosonic modes that lose energy en route from 
the transmitter to the receiver. Typical examples are free 
space or optical fiber transmission, in which photons are 
employed to convey the information. The classical capac- 
ity of the lossless bosonic channel — whose transmitted 
states arrive undisturbed at the receiver — was derived in 

0, 0. When there is loss, however, the received state 
is in general different from the transmitted state, and 
quantum mechanics requires that there be an accompa- 
nying quantum noise source. In a first step toward the 
capacity of such channels was given by considering only 
separable encoding procedures. Here, on the contrary, 
it is proven that the optimal encoding is indeed separa- 
ble. We obtain the value of C in the presence of loss 
when the quantum noise source is in the vacuum state, 

1. e., when it injects the minimum amount of noise into 
the receiver. Our derivation proceeds by developing an 
upper bound for C and then showing that this bound 
coincides with the lower bound on C reported in 0, [|| . 
Our upper bound results from comparing the capacity of 
the lossy channel to that of the lossless channel whose 
average input energy matches the average output energy 
constraint for the lossy case 0. This argument is anal- 
ogous to the derivation of the classical capacity of the 
erasure channel The lower bound comes from cal- 
culating the Holevo information for appropriately coded 
coherent-state inputs. Thus, because the two bounds co- 
incide, we not only have the capacity of the lossy bosonic 
channel, but we also know that capacity can be achieved 
by transmitting coherent states. 



Classical capacity.- The classical capacity of a chan- 
nel can be expressed in terms of the Holevo information 



x(Pj ,<?j) = S(£2 Pj a j)~Yl Pj S iP'i 
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(1) 



where pj are probabilities, Uj are density operators and 
S(g) = — Tr[g>log 2 g] is the von Neumann entropy. Since 
it is not known if x is additive, C must be calculated by 
maximizing the Holevo information over successive uses 
of the channel, so that C = sup„(C n /n) with 



p.,- 



(2) 



where the states <jj live in the Hilbert space 7i®" of n 
successive uses of the channel and Af is the completely 
positive map that describes the channel [tj. In our case, 
H is the Hilbert space associated with the bosonic modes 
used in the communication and Af is the loss map. Be- 
cause TL is infinite dimensional, C n diverges unless the 
maximization in Eq. (01 is constrained: here we assume 
that the mean energy of the input state in each of the n 
realizations of the channel is a fixed quantity £. For mul- 
timode bosonic channels, Af is given by ® fe A4, where 
Afk is the loss map for the fcth mode, which can be ob- 
tained, tracing away the vacuum noise mode bk, from the 
Heisenberg evolution 
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with afc and a' k being the annihilation operators of the 
input and output modes and < % < 1 is the mode 
transmissivity (quantum efficiency). 

The main result of this paper is that the capacity of 
the lossy bosonic channel, in bits per channel use, is 



C = max y2g(r] k N k ) , 

k 



(4) 



where g{x) = (x+ 1) log 2 (a; + 1) — x log 2 x and where the 
maximization is performed on the modal average photon- 
number sets {-/Vfc} that satisfy the energy constraint 



(5) 
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(ujk is the frequency of the fcth mode). 

We derive Eq. Q by giving coincident lower and up- 
per bounds for C. The right-hand side of Eq. Q was 
shown, in 0, to be a lower bound for C by generaliz- 
ing the narrowband analysis of [{|. This expression was 
obtained from Eq. J2J by calculating x for n = 1 under 
the following encoding: in every mode fc we use a mix- 
ture of coherent states \fj)k weighted with the Gaussian 
probability distribution 

Pfc (/i)=exp[-| M | 2 /iV fe ]/(7r7V fe ). (6) 

This corresponds to feeding the channel the input state 

e = (& d^pk(^) > ( 7 ) 
k J 

which is a thermal state that contains no entanglement 
or squeezing. The right-hand side of Eq. (0J is also an 
upper bound for C. To see that this is so, let pj, cfj 
be the optimal encoding on n uses of the channel, which 
gives the capacity C n of Eq. @ ■ The definition of x an d 
the subadditivity of the von Neumann entropy allow us 
to write 

n 

C n < S{M® n [a\) ^^S(^[ P f]) , (8) 

1=1 k 

where a = YljPj&j an d ■Mfc[f?fe ] is the reduced density 
operator of the fcth mode in the Ith realization of the 
channel, which is obtained from ]\[® n [a] by tracing over 
all the other modes and over the other n — 1 channel 
realizations. The first inequality in Eq. JSJl comes from 
bounding C n by the amount of information that can be 
transmitted through a lossless channel with input state 
Af® n [a], viz., the output of the lossy channel with opti- 
mal input state a Q ■ Now let be the average photon 
number for the state -fiV^I must satisfy the energy 
constraint JHJl for all I [llj] . Moreover, the loss will leave 
only r/kN^ photons, on average, in the corresponding 
output state Mk[o^]- This implies that 

SWklQ^XgiVkNjP), (9) 

where the inequality follows from the fact that the term 
on the right is the maximum entropy associated with 
states that have r/kN^ photons on average 0,0]. In- 
troducing Eq. into JSJ, we obtain the desired result 

n 

C " < E E 9(VkN { h l) ) ^ fimaxV; ff (TftJV fc ) , (10) 

1=1 k k 

where the maximization is performed over the sets {Nk} 
that satisfy Eq. Because Eq. (|10|) holds for any n, 
we conclude that the right-hand side of l@J is indeed also 
an upper bound for C. 



Discussion.- Some important consequences derive 
from our analysis. First, capacity is achieved by a single 
use of the channel (n = 1) employing random coding — 
factorized over the channel modes — on coherent states 
as shown in Eq. {7|). This means that, at least for 
this channel, entangled codewords are not necessary and 
that the Holevo information is not superadditive. Notice 
that the lossy bosonic channel can accommodate entan- 
glement among successive uses of the channel, as well 
as entanglement among different modes in each channel 
use. Surprisingly, neither of these two strategies is nec- 
essary to achieve capacity. Nor is it necessary to use any 
non-classical state, such as a photon number state or a 
squeezed state, to achieve capacity; classical (coherent 
state) light is all that is needed. Classical light suffices 
because the loss map M simply contracts coherent-state 
codewords in phase space toward the vacuum state. Co- 
herent states retain their purity in this process, and hence 
the non-positive part of the Holevo information — the sec- 
ond term of the right-hand side of Eq. Q — retains its 
maximum value of zero. Despite the preceding proper- 
ties, quantum effects are relevant to communication over 
the lossy bosonic channel. For example, our proof does 
not exclude the possibility of achieving capacity using 
quantum encodings, and such encodings may have lower 
error probabilities, for finite- length block codes, than 
those of the capacity-achieving coherent state encoding. 
This is certainly true for the lossless case. In particu- 
lar, it was already known that C can be achieved with a 
number-state alphabet 0, El ; our work shows that there 
is also a coherent-state encoding that achieves capacity 
for this case. [The two procedures employ the same aver- 
age input state, Eq. Q)]. However, the probability of the 
receiver confusing any two distinct finite-length number 
state codewords is zero in the lossless case, whereas it is 
positive for all pairs of finite-length coherent-state code- 
words. The lossless case also provides an example of the 
possible role of quantum effects at the receiver: the op- 
timal coherent-state system uses a classical transmitter, 
but its detection strategy, can be highly non-classical || . 
In contrast, the optimal number-state system for the loss- 
less channel requires a non-classical light source, but its 
receiver uses simple modal photon counting. 

How well can we approach this capacity using conven- 
tional decoding procedures? Using the coherent-state en- 
coding of Eq. J2J with either heterodyne or homodyne de- 
tection, the amount of information that can be reliably 
transmitted is 

7 = max^^log 2 (l + ?7fc 7V fc /e 2 ) , (11) 
h k 

where £ = 1/2 for homodyne and £ = 1 for hetero- 
dyne, and where, as usual, the maximization must be 
performed under the energy constraint (0). Equation 
f| 1 1 1) has been obtained by summing over fc the Shannon 
capacities for the appropriate detection procedure [3| . In 
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general I < C: heterodyne or homodyne detection can- 
not be used to achieve the capacity. However, heterodyne 
is asymptotically optimal in the limit of large numbers 
of photons in all modes, AT/. — > oo for all fc, because 
g(x) / \og 2 (x) — > 1 as x — ► oo. 

The capacity expression C can be simplified by us- 
ing standard variational techniques to perform the con- 
strained maximization in Eq. yielding [6j 



C = ^ g(r} k N k (fi)) 



(12) 



where Nk(/3) is the optimal photon number distribution 

1/% 



e f3huj k /r) k _ I 



(13) 



with f3 being a Lagrange multiplier that is determined 
through the constraint on average transmitted energy. 

In the following sections we calculate the capacities 
of some bosonic channels. The first two examples help 
clarify the derivation of Eq. the last is a realistic 
model of frequency-dependent lossy communication, on 
which we also evaluate the performance of homodyne and 
heterodyne detection. 

Narrowband channel.- Consider the narrowband 
channel in which a single mode of frequency u> is em- 
ployed. In this case, Eq. (jl2(l becomes 



(14) 



where N = £/(hu>) is the average photon number at the 
input. Equation Ijl4(l was conjectured in Q, where it 
was given as a lower bound on C. The following simple 
argument shows that g(rjN) is also an upper bound for 
C . Consider the lossless channel that employs r)N pho- 
tons on average per channel use. Its capacity is given 
by max e S(g), where the maximization is performed over 
input states g with mean energy £' — r/HuiN |l2l |. The 
maximum , co mputed through variational techniques, is 
g(r)N) 0, ^3 ■ The lossless channel cannot have a lower 
capacity than the lossy channel, because both have the 
same average received energy, and the set of receiver 
density operators achievable over the lossy channel is a 
proper subset of those achievable in the lossless system 
pj. This implies that g(r/N) is an also upper bound on 
C and hence equal to C. 

Frequency-independent loss.- Now consider a broad- 
band channel with uniform transmissivity, rjk = f], that 
employs a set of fre quen cies tau — fc Slu for fc 6 N. In this 
case, Eq. (tT2l gives [U] 



ln2V 3h 



(15) 



where T = 2ir/duj is the transmission time, and V = £ /T 
is the average transmitted power. Equation (|15|l was de- 
rived for the lossless case (t) = 1) in [2j and was shown to 



provide a lower bound on C in |(| . In order to show that 
the right-hand side of Eq. (fT~5f) is also an upper bound, 
consider the lossless broadband channel in which the av- 
erage input power is equal to rfP, viz., the average out- 
put power of the lossy channel. According to Q, the 
capacity of this channel is {\/ ~kt]P /3)T / In 2, which coin- 
cides with the right-hand side of Eq. JTSJ. The reasoning 
given above for the single-mode case now implies that the 
broadband lossless channel's capacity cannot be less than 
that of the broadband lossy channel, thus completing the 
proof. 
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FIG. 1: Capacities of the far-field free-space optical chan- 
nel as a function of the input power V (in the plot Vo = 
2-irhc 2 L 2 / (AtA r )) . The solid curve is the capacity C from 
Eq. I16L the other two curves are the capacities / from 
Eq. psp achievable with coherent states and heterodyne de- 
tection (dashed curve) or coherent states and homodyne de- 
tection (dotted curve). Note that the heterodyne detection / 
approaches the optimal capacity C in the high-power limit. 

Far-field, free-space optical communication.- Con- 
sider the free-space optical communication channel in 
which the transmitter and the receiver communicate 
through circular apertures of areas A t and A r that are 
separated by an L-m-long propagation path. At fre- 
quency u> there will only be a single spatial mode in the 
transmitter aperture that couples appreciable power to 
the receiver aperture when the Fresnel number D(u>) = 
A t A r (uj/2TrcL) 2 satisfies D(cu) < 1, 0. This is the far- 
field power transfer regime at frequency u), and D(uj) is 
the transmissivity achieved by the optimal spatial mode. 
A broadband far-field channel results when the transmit- 
ter and receiver use the optimal spatial modes at frequen- 
cies up to a critical frequency u Cl with D(lo c ) <C 1. In 
this case we use r/k = D(uik) in Eq. I|12|) . and the capacity 
C becomes lljj 



c _ r dx ( 1 



(16) 



where yo is a dimensionless parameter inversely propor- 
tional to the Lagrange multiplier /3, which is determined 
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from the power constraint 

_ 2Trhc 2 L 2 r Vo dx i 
p -^atJ ~^~\- (17) 

Although C is proportional to the maximum frequency 
u) c , this factor cannot be increased without bound, for 
fixed transmitter and receiver apertures, because of the 
far-field assumption. Figure^plots C versus V obtained 
from numerical evaluation of Eqs. (|16|) and l|17|). 
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FIG. 2: Power spectrum S = uikNk for the far-field free-space 
channel plotted versus frequency in the continuum regime 
. The solid curve is for optimal capacity, the dotted curve 
is for homodyne detection, and the dashed curve is for hetero- 
dyne detection. Here V/Vo ~ 3. In contrast to the frequency- 
independent lossy channel, all of these coherent-state encod- 
ings preferentially employ high frequencies instead of low fre- 
quencies. This marked change in spectral shaping is due to 
the transmissivity's having a quadratic dependence on ui. 

To compare the capacity of Eq. with the infor- 
mation transmitted using heterodyne or homodyne de- 
tection, we perform the Eq. maximization. The 
Lagrange multiplier technique gives the optimal value 
N k (P) = max{l/(/3^ fe ) - £ 2 /r) k , 0}, plotted in Fig.0 
[Notice that the non-negativity of this solution forbids 
the use of frequencies lower than uio = £ 2 f3htu 2 /D((jj c ).] 
With this photon number distribution, Eq. <|11|) becomes 

I = fr c T(l/y -l + lny )/(2irhi2) , (18) 

where yo is now determined from the condition V = 
£ 2 2irhc 2 L 2 (yo - 1 - lny )/(A r A s ). We have plotted / 
versus P in Fig. ^for heterodyne and homodyne detec- 
tion. At low power, the noise advantage of homodyne 
makes its capacity higher than that of heterodyne. At 
high power levels heterodyne prevails thanks to its band- 
width advantage, and its capacity approaches C asymp- 
totically. 

Conclusions.- We have derived the classical capacity 
of the lossy multimode bosonic channel when the average 



energy devoted to the transmission is bounded. Interest- 
ingly, quantum features of the signals (such as entangle- 
ment or squeezing) are not required to achieve capacity, 
because an optimal coherent-state encoding exists. At 
the decoding stage, however, quantum effects might still 
be necessary (e.g., in the form of joint measurements on 
the output) as standard homodyne and heterodyne mea- 
surements are not optimal, except for the high power 
regime where heterodyne detection is asymptotically op- 
timal. The focus of this paper has been the lossy chan- 
nel with minimal (vacuum-state) noise. A more general 
treatment would include non- vacuum noise, and would 
allow for amplification. 
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